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PROBLEM 

To  investigate  certain  properties  of  stochastic  attrition  processes— spe¬ 
cifically  to  formulate  the  general  problem  of  the  interaction  of  two  groups  of 
combatants  when  chance  plays  a  role,  to  show  what  data  must  be  given  before 
the  problem  can  be  solved,  to  determine  explicitly  a  complete  solution  ol  the 
general  problem,  to  indicate  a  method  for  surmounting  the  practical  difficulties 
encountered,  and  to  carry  out  this  method  by  analyzing  the  probability  that  a 
given  side  will  win. 


FACTS 

F.  W.  Lanchester  published1  an  analysis  of  the  effects  of  concentration  of 
firepower  in  combat.  He  consideredtwo  groups  of  forces  opposing  one  another, 
assuming  that  the  rate  of  attrition  for  each  force  depended  on  the  strength  of  the 
opposition.  He  thus  obtained  a  system  of  differential  equations  describing  the 
behavior  of  the  two  forces. 

During  World  War  II  there  was  a  revival  of  interest  in  this  theory  of  combat, 
though  Lanchester’s  strictly  deterministic  treatment  of  the  problem  was  mod¬ 
ified  to  include  the  random  fluctuations,  which  inevitably  occur.  Fundamental 
work  in  formulation  of  the  problem  in  a  stocnastic  framework  was  carried  out 
by  the  Operations  Research  Group  in  the  Department  of  the  Navy;  such  formu¬ 
lations  are  now  being  used  at  ORO  in  combat  models. 


DISCUSSION 

In  applying  a  stochastic  analysis  to  Lanchester’s  theory  of  combat  this 
memorandum  considers  a  certain  Markov  stochastic  process  characterized  by 
the  initial  state  and  the  two  functions  <j>  and  a.  The  probability  that  the  system 
remains  in  a  given  state  (m,n)  throughout  a  time  interval  of  length  t  isc~^(m’n). 
Alternatively  it  where  r{m,n)  is  the  expected  duration  of  the  sys¬ 

tem  in  the  state  (m,K)  and  r(m,n)  =  l/0(m,n).  The  probability  that,  leaving  the 
state  (m,rt),the  system  goes  to  the  state  (m,n  - 1)  is  a(m,n);  the  probability  that, 
leaving  the  state  (m,n),  it  goes  to  tjre  state  (m  -l,n)  is  1  -a(m,n),  which  is  also 
called  (3(m,n).  To  find  p(a,b,t;  m,«)  is  to  determine  the  probability  that,  starting 
at  (m,n),  the  system  will  be  at  (a  ,b )  afterthe  elapsed  time  t.  Winning  means  the 
complete  annihilation  of  the  opposing  side;  the  probability  that  the  first  side 
wins  from  state  (m,n)  is  denoted  by  P(m,n).  This  probability  satisfies  the  dif¬ 
ference  equation  P(m,n)  =a(m,n)  P(m,n  -  l)  +  /3(m,n)  P(m-l,n). 
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CONCLUSIONS 

1.  The  initial  state,  the  expected  duration  in  each  possible  subsequent 
state, and  the  chances  of  the  transition  from  each  state  to  the  two  possible  im¬ 
mediately  succeeding  states  completely  determine  the  stochastic  process. 

2.  Exact  expression  for  the  probability  that  a  given  side  wins  is  too  com¬ 
plicated  to  be  of  practical  use,  but  it  is  demonstrated  that  it  is  possible  to  find 
a  simple  useful  approximation. 

3.  The  two  functions  0  and  a,  which  characterize  the  stochastic  process, 
may  be  expressed  in  terms  of  more  readily  accessible  data,  n?"n*’y,  the  rate 
of  fire  and  the  single -shot  kill  probability,  using  the  basic  assumptions  from 
Lanchester's  original  work. 
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A  STOCHASTIC  ANALYSIS  OF  LANCHESTER’S  THEORY  OF  COMBAT 


INTRODUCTION 


An  analysis  of  the  effects  of  concentration  of  firepower  in  combat  was  made 
by  F.  W.  Lanchester  in  1916.  He  presented  a  system  of  differential  equations  de¬ 
termining  the  behavior  of  two  groups  of  forces  opposing  one  another,  assuming 
for  each  that  the  rate  of  attrition  depended  on  the  strength  of  the  opposition. 
Lanchester’s  theory  of  combat  was  deterministic;  his  differential  equations  im¬ 
plied  that  the  future  was  completely  determined.  During  World  War  II,  however, 
a  more  realistic  approach  to  the  problem  was  undertaken,  and  a  theory  allowing 
for  chance  fluctuations  was  created.  It  is  this  stochastic  theory  which  is  studied 
here. 

In  this  paper  an  attempt  is  made  to  perform  three  services  for  persons 
working  in  this  field  at  the  present  time: 

(a)  To  provide  a  general  formulation  of  the  problem,  in  terms  of  which  the 
various  specialized  results  of  other  groups  of  investigators  can  be  interpreted. 
This  is  done  in  the  first  section. 

(b)  To  exhibit  a  complete  solution  of  the  general  problem  as  thus  formu¬ 
lated.  This  is  done  in  the  second  section.  Although  the  solution  is  complete  it 
is,  in  its  general  form,  of  little  practical  use.  In  various  special  cases,  how¬ 
ever,  this  solution  is  useful  in  studying  certain  properties  of  the  solution.  These 
will  readily  occur  to  the  worker  in  the  field. 

(c)  To  indicate  a  method  by  which  the  practical  difficulties  encountered  in 
studying  the  general  solution  can  be  surmounted.  This  is  done  in  the  third  sec¬ 
tion,  largely  by  exhibiting  the  method  as  applied  to  a  specific  aspect  of  the  gen¬ 
eral  problem.  Although  the  problem  studied  in  the  third  section  is  quite  special, 
it  seems  clear  that  the  asymptotic  methods  used  there  can  be  extended  to  other 
problems  arising  from  the  solution  given  in  the  second  section.  Work  along  these 
lines  is  now  in  progress. 

To  obtain  the  desired  generality  the  formulation  of  the  problem  in  the  first 
section  is  necessarily  abstract.  It  is  proposed  to  give  here  a  concrete  formula¬ 
tion  of  the  problem  so  that  some  meaning  can  be  attached  to  the  ideas  in  the  first 
section. 

Consider  two  opposing  forces  called  the  “first  side”  and  the  “second  side.” 
The  forces  try  to  destroy  one  another;  hence,  as  time  goes  on,  the  strengths  of 
the  two  sides  diminish.  Letx(t)  denote  the  number  of  survivors  on  the  first  side 
at  time  t ,  and  let  y  (t)  denote  the  number  of  survivors  on  the  second  side  at  time 
t .  The  fundamental  point  of  view  in  this  paper  is  that  x(t)  and  y(t)  are  random 
variables:  they  are  not  completely  determined  beforehand,  but  the  probability 
that  x(t)  and  y(t)  have  given  values  is.  Specifically,  the  question  is  “What  is  the 
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probability  p  (a  ,b,t;  m  ,n)  that  there  will  be  a  and  b  survivors  on  the  first  and 
second  sides,  respectively,  at  time  t,  if  there  were  initially  m  and  n  members 
on  the  two  sides?” 

The  first  section  is  concerned  with  the  precise  formulation  of  this  problem, 
in  addition  to  a  discussion  of  the  parameters  that  serve  to  characterize  the  ef¬ 
fectiveness  of  each  side  ir.  combat.  For  completeness  the  differential-difference 
equations  for  the  problem,  used  by  many  workers  in  stochastic  processes,  are 
also  derived.  The  asymptotic  devices  used  in  the  third  section,  however,  are  not 
based  on  these  equations. 

In  the  second  section  p(a,b,t;  m,n)  is  completely  determined.  It  is  exhibited 
however  as  an  integral  in  which  the  complicated  structure  of  the  integrand  tends 
to  obscure  the  understanding  of  the  precise  nature  of  the  solution.  This  complex¬ 
ity  arises  mainly  from  certain  combinatorial  difficulties  associated  with  “paths,” 
defined  in  the  first  section  and  examined  more  fully  in  the  second.  The  situation 
is  somewhat  analogous  to  the  situation  in  certain  one-dimensional  stochastic 
processes,  where  the  exact  solution  is  apparently  quite  useless. 

If  a  useful  solution  to  the  problem  of  finding  p(a,b,t;  m,n)  cannot  be  given, 
then  an  effort  can  be  made  to  answer  related  questions.  One  question  of  evident 
interest  is  this:  “What  is  the  probability  that  the  first  side  will  win?”  Here 
winning  means  the  destruction  of  the  opposing  force  before  a  given  side  is  anni¬ 
hilated.  Again,  a  precise  answer  is  complicated  beyond  usefulness,  but  a  useful 
approximate  answer  can  be  given.  In  the  third  section  this  approximate  answer 
is  found.  Comparison  of  the  exact  solution  and  the  approximation  in  certain 
numerical  examples  shows  that  the  agreement  is  surprisingly  good.  These  ex¬ 
amples  are  given  in  the  last  section. 

It  must  be  admitted  that  many  problems  remain  to  be  solved  before  a  co¬ 
herent  stochastic  theory  of  combat  is  evolved.  This  memorandum  claims  only 
to  frame  the  problem  in  a  precise  way,  then  to  formulate  an  exact  solution  of 
this  problem  (to  spare  future  workers  the  labor  of  writing  out  this  solution),  and 
finally  to  present  a  useful  approximation  to  the  probability  that  one  side  will 
win.  Among  the  many  other  problems  that  remain,  the  three  following  are  of 
special  interest: 

(a)  .What  is  a  useful  approximation  for  p(a,b,t;  m,n)? 

(b)  Find  an  approximation  for  the  expected  number  of  survivors  on  each 
side  in  terms  of  time.  How  close  is  this  to  the  solution  of  the  classical  deter¬ 
ministic  Lanchester  equations? 

(c)  How,  if  at  all,  can  the  central  limit  theorems  of  Bernstein  and  LoWe 
for  dependent  probabilities  be  used  here  to  simplify  the  derivation  of  asymptotic 
formulas? 

It  is  hoped  that  this  memorandum  may  help  to  stimulate  work  on  these  re¬ 
lated  problems. 
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STATEMENT  OF  THE  PROBLEM 


Consider  a  physical  system,  the  state  of  which  at  any  instant  t  is  described 
by  an  ordered  pair  of  nonnegative  integers  [x (t ) ,y (t )].  The  system  is  to  be  ob¬ 
served,  starting  at  some  convenient  time  which  will  be  taken  as  zero.  The  actual 
behavior  of  the  system  is  thus  completely  prescribed  by  two  functions,  x  and  y , 
defined  on  the  set  of  all  nonnegative  real  numbers,  with  values  in  the  set  of  non¬ 
negative  integers.  The  possible  behavior  of  the  system  is  circumscribed  by  the 
following  two  assumptions:  first,  that  both  x  and  y  are  nonincreasing  functions, 
i.e.,  if  t’>t ,  then  x (t ' )  <  x(t)  and  y(t')  =?y(t);  and,  second,  that  when  the  system 
leaves  any  state  ( a,b) ,  it  must  proceed  to  the  state  (a-l,b)  or  the  state  (a,b-l). 


The  set  of  all  possible  modes  of  behavior  of  this  system  may  thus  be  identi¬ 
fied  with  the  set  of  all  ordered  pairs  of  nonincreasing,  nonnegative,  integral¬ 
valued  functions,  defined  on  the  set  of  all  nonnegative  real  numbers,  such  that 
both  functions  in  the  ordered  pair  are  not  discontinuous  at  the  same  number,  and 
such  that  at  a  point  where  it  is  discontinuous,  a  function  decreases  by  one  unit. 
The  set  of  all  ordered  pairs  of  such  functions  will  be  denoted  by  S.  For  a  further 
description  of  the  sort  of  situation  envisioned  here,  the  reader  is  referred  to  work 
by  Koopman,  cited  in  Morse  and  Kimball.2 

An  element  of  the  set  S  is  an  ordered  pair  of  functions.  An  example  of  the 
graphs  of  such  an  ordered  pair  of  functions  might  look  like  Fig.  1. 
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An  alternative  (and  not  so  informative)  method  of  representing  this  behavior 
graphically  is  to  show  the  path  followed  by  the  system,  disregarding  the  time  of 
transition  from  one  state  to  another.  The  path  diagram  which  would  correspond 
to  the  situation  shown  in  Fig.  1  is  given  in  Fig.  2. 

It  should  be  evident  that  each  element  of  S  (i.e.,  each  ordered  pair  of  func¬ 
tions  of  the  type  being  considered)  determines  uniquely  a  graph  in  the  fashion  of 
Fig.  2.  On  the  other  hand,  to  each  graph  of  the  type  shown  in  Fig.  2,  infinitely 
many  elements  of  the  set  S  will  generally  correspond  (or,  equivalently,  infinitely 
many  graphs  like  Fig.  1). 


X 


Fig.  2— The  Path  for  Fig.  1. 


Thus  far  the  discussion  has  centered  on  the  possible  representation  of  the 
behavior  of  the  system.  It  is,  of  course,  a  much  more  fundamental  problem  to 
discuss  the  mechanism  leading  to  a  particular  mode  of  behavior  for  the  system. 
It  might  be,  for  example,  that  the  first  coordinate  of  the  system  decreases  by 
one  unit  at  a  quarter  past  and  a  quarter  to  every  hour,  but  the  second  coordinate 
decreases  by  one  unit  every  hour  on  the  hour.  In  such  a  case,  only  the  initial 
state  of  the  system  need  be  known  in  order  that  the  state  of  the  system  be  known 
at  any  subsequent  time.  Under  these  conditions  the  process  is  said  to  be  deter¬ 
ministic.  The  process  considered  in  Morse  and  Kimball3  is  deter  mini  sti  c” 

This  is  just  the  situation  that  will  not  be  considered  here.  Instead  it  will 
be  assumed  that  chance  affects  the  system,  so  that  when  the  initial  state  of  the 
system  is  known,  only  the  probability  that  the  system  will  be  in  a  certain  state 
at  a  subsequent  time  is  known.  Such  a  process  is  said  to  be  “stochastic.”  Morse 
and  Kimball  also  consider  a  stochastic  process.4  For  a  more  detailed  but  more 
technical  description  of  stochastic  processes  in  general,  the  reader  is  referred 
to  Doob.s 

As  a  generalization  of  the  situation  in  a  great  many  practical  problems 
the  following  assumptions  are  made  for  the  stochastic  process  being  considered 
here:  first,  that  it  is  a  Markov  process,  and,  second,  that  it  is  a  stationary 
process.  For  the  technical  definitions  of  a  Markov  process  and  a  stationary 
process,  see  Doob.8  Both  terms  will  be  used  here  “in  the  strict  sense.”  An 
intuitive  interpretation  of  these  assumptions  follows. 
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The  assumption  that  the  process  is  a  Markov  process  can  be  expressed 
intuitively  by  saying  that,  from  any  instant  onward,  the  behavior  of  the  system 
may  depend  on  the  state  of  the  system  at  that  instant,  but  it  certainly  does  not 
depend  on  the  previous  history  of  the  system.  Thus,  if  0  <  <  t2  <  .  .  .  <  tn  <  , 
the  probability  that  the  system  is  in  state  (a,b)  at  the  instant  t,  given  that  it  was 
in  state  (a^bj)  at  ,  (a2,b2)  at  t2 ,  .  .  .,  and  (an,bn)  at  tn  ,  depends  only  on  a,  b,  £, 
an>  kn.  and  t„ ,  not  on  the  states  at  the  earlier  instants.  Consequently  only  prob¬ 
abilities  of  the  form  p (c,d,t;  a,b,s )  need  be  considered,  which  denote  the  prob¬ 
ability  that  the  system  is  in  state  (c,d)  at  the  instant  t ,  given  that  the  system  is 
in  state  (a,b)  at  the  instant  s,  with  s  <£. 

The  second  assumption  (that  the  process  is  stationary)  introduces  a  fur¬ 
ther  simplification.  It  means  that  what  happens  during  a  given  time  interval 
depends  only  on  the  state  of  the  system  at  the  beginning  of  the  interval,  and  on 
the  length  of  the  time  interval— not  on  the  instant  at  which  the  time  interval  be¬ 
gins.  Consequently  p (c,d,s  +  t  ;  a,b,s)  =  p (c,d,t ;  a  ,b,0),  which  might  as  well  be 
(and  which  will  be)  denoted  by  p(c,d,t;  a, b). 

In  summary  this  investigation  is  to  be  concerned  with  the  probabilities 
P (c,d,t;  a, b),  which  denote  the  probability  that,  if  the  system  is  at  any  instant 
in  the  state  (n,b),  then  it  will  be  in  the  state  (c,d)  after  a  time  interval  of 
length  t . 

Certain  properties  of  p( c, d, t ;  a,b),  which  are  consequences  of  the  preced¬ 
ing  assumptions,  will  now  be  derived. 

Suppose  now  that  the  system  is  initially  in  a  state  (m,n).  The  probability 
that  it  will  proceed  to  a  state  (a,b)  during  an  interval  of  time  of  length  s  is 
p(a,b,s;  m,n),  and  the  probability  that  it  will  then  proceed  to  a  state  (c  ,d )  during 
a  succeeding  interval  of  length  t  is  p(  c,d,t;  a,b),  since  the  system  is  in  state 
(a,b)  at  the  beginning  of  this  second  interval.  Thus  the  probability  that,  if  the 
system  is  initially  at  (m,n),  it  proceeds  to  (a,b)  during  the  interval  of  length  s 
and  thence  to  (c  ,d)  during  the  succeeding  interval  of  length  t  is  p(a,b,s;  m,n)  • 
p(c,d,t;  a, b). 

Consider  next  the  probability  that,  if  the  system  is  initially  at  (m,n),  it  proceeds 
to  (c  ,d)  during  an  interval  of  length  s  +t .  It  is,  of  course,  denoted  by  p  (c,d,s  +£  ; 
m,n).  But,  this  can  also  be  expressed  as  a  sum  of  products  of  the  type  just  con¬ 
sidered.  For  at  the  end  of  the  first  interval  of  length  s  the  system  must  be  in 
some  state  between  (m,n)  and  (c,d),  inclusive.  Since  it  can  be  in  only  one  state 
at  a  given  instant,  the  various  possibilities  are  mutually  exclusive.  Hence, 
p(c,d,s+t;  m,n)  is  the  sum  of  all  the  probabilities  of  the  compound  events  de¬ 
scribed  in  the  preceding  paragraph;  i.e. 

m  n 

p(c,d,  s  +  t;  m,n)  =  £  Z]  p(o,b,s;  m,n)  p{c,d,t;  a,b).  (1) 

o=c  b=d 

Equation  1,  which  will  be  found  to  be  fundamental  in  the  subsequent  work,  is 
essentially  a  special  case  of  Feller’s7  Eq.  9.2  or  of  Doob’s8  Eq.  1.2'. 

A  special  case  of  Eq.  1  arisec  if  c  =m  and  d  =  n.  In  this  case 

p(m,n,  s  +  t;  m,n)  =p(m,n,s;  m,n)  p(m,n,t ;  m,n). 

[The  appropriate  verbal  interpretation  of  p(m,n,t;  rn.n)  is  the  probability  that 
the  system  remains  in  the  state  ( m,n)  throughout  a  time  interval  of  length  t , 
if  the  system  is  in  this  state  as  the  beginning  of  the  time  interval.] 
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ecause  p  is  a  probability  function,  it  follows  from  the  general  theory  of 
stochastic  processes  that 

p{m,n,t;  m,n)  =  f  (2) 

where  </.•  m,n)  is  nonnegative.  Indications  of  the  method  of  proof  of  this  result 
are  fou  >d  in  Doob. 

Ir  certain  (unnecessary)  assumptions  are  made  about  the  function  p ,  a 
simple  proof  of  Eq.  2  can  be  given.  The  special  case  of  Eq.  1  under  considera¬ 
tion  is  an  equation  of  the  form 

/(*+«)-/(*)  /(<). 

Differentiate  with  respect  to  s;  then 

/'(*+<) -/'(*) /(i). 

Now  set  s  equal  to  zero,  obtaining 

/'(«)  =  C  /(£), 

where  c  =f'(0).  This  is  a  differential  equation  whose  solutions  are  of  the  form 

/(0=aeef, 

where  a  is  an  arbitrary  constant.  Now  in  the  special  case  of  Eq.  1,  which  is  of 
interest  here,  the  condition  that  p(tti,n,0;  tn,n)  should  equal  one  requires  that  a 
be  one,  and  the  condition  that  p  (m,n,t ;  m,n)  be  less  than  one  for  positive  I  re¬ 
quires  that  c  in  the  exponent  be  negative.  This  establishes  Eq.  2. 

Of  course,  this  proof*  has  assumed  that  all  the  functions  encountered 
possess  derivatives.  This  is  an  unnecessary  assumption,  as  the  work  in  Doob 
shows. 

Because  of  the  importance  of  the  facts  expressed  by  Eq.  2,  the  equation  is 
given  the  following  verbal  interpretation:  The  probability  that  the  system  does 
not  change  from  the  state  ( m,n)  during  an  interval  of  time  of  length  t  is 

Consider  next  the  variate  (or  “random  variable”)  Tm  whose  value  is  the 
length  of  time  during  which  the  system  remains  in  the  state  (m,n). 

Let  F  denote  the  (cumulative)  distribution  function  for  Tm  n  ;  i.e.,  F(«)  is 
the  probability  that  the  observed  value  of  the  variate  Tm  „  is  less  than  or  equal 
tou.  Then,  for  any  positive  number  u, 

F(u)  =  Pr  jrm>n  <«} 

=  Pr  {the  system  is  no  longer  in  ( m,n)  after  a  time  u} 

=  1  -  Pr  {no  change  occurs  during  the  time  u} 

=  1  - 

while  F(u)  =  0  if  u  is  zero,  or  negative.  Obviously,  F'(u)=  0,  if  u<0,  and  F'(u)  = 
<t>(m,n)  ,ifu>o. 

The  expected  value  of  Tm  n ,  which  will  be  denoted  by  T(m,n),  is  given  by 
the  following  procedure: 

r(m,n)  =  /  udF(u)  =  f  u$(m,n)  e-»<M  du  = 

—on  0 

Thus  <f>(m,n)  can  be  interpreted  as  the  reciprocal  of  the  average  duration  of 
the  system  in  the  state  ( m,n).  Perhaps  it  should  be  stated  explicitly  that  the 
symbol  t(  m,n)  will  be  used  throughout  the  sequel  to  mean  the  expected  value  of 

T m,n  • 
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Up  to  this  point  the  discussion  appears  to  be  concerned  principally  with 
the  probability  that  the  system  does  not  undergo  any  changes  of  state  during  a 
given  interval  of  time.  The  next  part  of  the  discussion  is  primarily  concerned 
with  establishing  the  notation  necessary  for  considering  what  happens  when  the 
system  does  change  its  state. 

Suppose  the  system  is  in  a  certain  state  (m,n).  By  the  second  of  the  funda¬ 
mental  assumptions  made  at  the  beginning  of  this  section,  the  system  can  pro¬ 
ceed  only  to  states  (m-l,n)  and  (m,n-l)  when  it  leaves  the  state  (m,n).  Let 
a(m,n)  denote  the  probability  that  the  system,  on  leaving  the  state  (m,n),  proceeds 
to  the  state  (m,n-  1),  and  let  0(m,n)  denote  the  probability  that  the  system,  on 
leaving  the  state  (m,n),  proceeds  to  the  state  (m  -  l,n).  Of  course,  0(m ,n)  = 

1  -a(m,n). 

It  may  now  be  intuitively  evident  that  if  the  initial  state  of  the  system  (cor¬ 
responding  to  t  =0),  and  the  two  functions  0  and  a  are  given,  then  all  the  prob¬ 
abilities  p  (a , b, t ;  m,n)  are  completely  determined.  In  any  event,  this  is  the  case, 
as  the  following  discussion  will  show. 

The  principal  objective  now  is  to  derive  two  differential-difference  equa¬ 
tions  that  must  be  satisfied  by  p(a,b,t;  m,n).  Each  such  equation  is  based  on 
Eq.  1. 

First  Equation.  Differentiate  Eq.  1  with  respect  to  s,  and  then  set  s  equal 
to  zero.  The  result  is 

m  n 

P3(c,<l,f,  m,n)  =  21  Z1  P3M,0;  "i.n)  p(c,d,t\  a,b), 
a=c  bt-d 


in  which  the  subscript  3  denotes  the  first-order  partial  derivative  with  respect 
to  the  third  argument;  i.e., 

p3(c,rf,t;  m,n)  =  lim  [p{c,d,  t  +  6;  m,n )  -  p'e.rf.t;  m,n)]/h. 
h-t  0 


Second  Equation.  Differentiate  Eq.  1  with  respect  to  t  ,  and  then  set  t 
equal  to  zero.  The  result  is 

m  n 

p3(c,d,s;  m,n)  ~  H  P^a'^’s’  m>n'>  P-^c.d.O;  a,b), 
a  =  c  b*d 

Now  it  turns  out  that  most  of  the  terms  that  appear  in  the  sums  in  these 
two  equations  are  zero.  It  is  clear  from  Eq.  2  that  p3(a,b,0;  a,b)  = -0(o,b).  It 
can  be  shown  by  the  general  theory,  or  directly  by  quite  simple  arguments, that 

p3(o,  6-1,0;  a,b)  =  a  (a,  6)  <£(<i,6) 
p3(a  -  1,  6,  0;  a, 6)  =  /)(a,6)  cj>(a,b), 


and  that  all  the  other  derivatives  are  zero.  The  general  theory  is  given  in 
Doob;  0  an  alternative  treatment,  with  less  generality,  is  contained  in  Feller.11 

If,  in  the  second  equation,  the  -ariable  s  is  replaced  by  t,  the  two  equations 
can  be  written  now  as 


{d/dt)  p(a,b,t;  m,n)  =  -  0(m,n)  p(a,6,t;  m,n) 


+  tx(m,n)  <6(m,n)  p(a,b,t ;  m,  n  -  1)  +  0{m,n)  </>(m,n)  p{a,b,t ;  m  -  l,n) 


and 


{d/dt)  p(a,b,t;  m,n)  =  ~-(/){a,b)  p(a,b,c;  m,n) 


(3) 


(4) 


+  a(a,  6  +  1)  0(a,  6  +  1)  p(a,  6  +  1,  t;  m,n)  +  )](a+l,6)  0(a+  1,  6)  p(a+  1,6,1;  m,n). 
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Clearly  the  solutions  of  these  equations  will  depend  only  on  the  coefficients 
which  are  determined  by  0  and  a  (since  /3  =  1  -a),  and  on  the  initial  state  (m,n).  ’ 

Hence,  the  validity  of  the  earlier  assertion  regarding  the  dependence  of  p(a,b,t; 
m,n)  on  0  and  a  has  been  established. 

Equations  3  and  4  correspond,  respectively,  to  Eqs.  1.7  and  1.7'  in  Doob.12 
SL.ch  equations  are  frequently  distinguished  by  the  adjectives  “backward”  and 
forward.”  Equation  3  is  the  backward  one  for  this  process.  For  a  further 
discussion  of  these  two  equations,  consult  Doob,13  or  Feller.14  Note  also  that 
an  analogue  of  Eq.  3  appears  in  Morse  and  Kimball.15 

It  is  now  perhaps  appropriate  to  summarize  what  has  developed  so  far. 

A  certain  stationary  Markov  stochastic  process  is  under  consideration. 

The  process  is  characterized  by  the  initial  state,  and  the  two  functions,  <t>  and 
a.  The  probability  that  the  system  remains  in  a  given  state  (m,n)  throughout  a 
time  interval  of  length  t  is  e~t^m>n\  Alternatively,  it  is  c  •Vrtm.n),  Where 
t(ih  n)  is  the  expected  duration  of  the  system  in  the  state  (m,n),  and  r(m,n)  = 
l/0(ffl,n).  The  probability  that,  on  leaving  the  state  (m,n),  the  system  goes  to 
the  state  (m,n-l)  is  Qf(m,w);  the  probability  that,  on  leaving  the  state  (m,n),  it  goes 
to  the  state  (m-  l,n)  is  1  -a(m,n),  which  is  also  called  0(m,n). 

Finally,  if  p (a, b,  t ;  m,n )  denotes  the  probability  that  the  system,  starting  at 
state  (m,n),  is  in  state  ( a,b )  after  an  elapse  of  time  t ,  then  p  must  satisfy  both 
Eqs.  3  and  4. 

The  fundamental  problem,  which  will  be  solved  completely  in  the  next  sec¬ 
tion  can  now  be  stated: 

Given  the  two  functions  0  and  a,  write  out  explicitly})  (a,b,t;  m,  n). 
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EXACT  DETERMINATION  OF  THE  PROBABILITIES  p(a,b,tj  m,n) 


In  this  section  p (a ,b , t ;  m,n)  will  be  determined  exactly.  However,  it  will 
be  seen  that  the  result,  unless  a  is  close  to  m,  and  b  is  close  to  n,  is  of  little 
immediate  practical  interest  in  the  general  case. 

An  obvious  possible  way  to  determine  p(a,b,t;  m,n)  is  to  solve  the  partial 
differential-difference  equations,  Eqs.  3  and  4  of  the  preceding  section.  It  is 
the  author’s  experience,  however,  that  this  method  does  not  lead-in  any  evident 
way— to  a  form  of  the  solution  that  is  convenient  for  the  applications  that  have 
given  rise  to  this  research.  Consequently  the  problem  will  be  attacked  by  meth¬ 
ods  from  the  general  theory  of  probability. 

To  findp(a,b,t;  m,n)  is  to  answer  the  question,  “What  is  the  probability 
that,  starting  at  (m,n),  the  system  will  be  at  (a,b)  after  an  elapsed  time  t  ?” 

For  the  system  to  start  at  (m,n)  and  later  be  at  (a,b),  it  must  proceedalong 
one  of  the  paths  (as  in  Fig.  2)  that  join  (m,n)  to  (a, b).  It  may  move  along  any  such 
path,  and  motions  along  different  paths  are  mutually  exclusive  events. 

It  is  easy  to  compute  the  probability  that  the  system  will  move  along  a  given 
path.  For  example,  suppose  the  system,  initially  at  (5,7)  proceeds  through  (5,6), 

(5.5) ,  (4,5),  (4,4),  (4,3),  and  (3,3)  to  (3,2).  The  probability  of  going  from  (5,7)  to 

(5.6)  is  a(5,7);  from  (5,6)  to  (5,5)  is  a(5,6);  from  (5,5)  to  (4,5)  is  (3(5,5);  etc. 

Hence  the  probability  of  moving  along  this  particular  path  is  a(5,7)  •  a(5,6)  • 
0(5,5)  •  a(4,5)  •  a(4,4)  .  (3(4,3)  •  a(3,3). 

Similarly,  if  there  is  a  quite  arbitrary  path  from  (m,n)  to  (a,b),  there  would 
be  a  certain  a  for  each  downward  step  and  a  certain  (3  for  each  leftward  step. 

The  product  of  these  a’s  and  (3’s  would  give  the  probability  of  moving  along  this 
particular  path. 

In  order  to  present  these  ideas  as  clearly  as  possible  and  to  facilitate  cer¬ 
tain  future  discussions  it  seems  desirable  to  introduce  the  following  method  for 
describing  paths: 

Consider  a  path  from  (m,n)  to  (a,b).  Each  transition  consists  of  a  motion 
either  one  step  to  the  left  (the  first  coordinate  decreases  by  one  unit)  or  one 
step  down  (the  second  coordinate  decreases  by  one  unit).  A  convenient  way  to 
describe  the  path  is  to  write  down  a  sequence  of  m  -  a  zeros  and  n  -  b  ones,  a 
zero  corresponding  to  a  motion  to  the  left,  and  a  one  corresponding  to  a  motion 
down.  In  this  system,  the  path  described  a  little  earlier,  beginning  at  (5,7)  and 
ending  at  (3,2),  would  be  written  1101101. 

To  formalize  this  procedure  in  the  general  case,  proceed  as  follows:  Let 
x  =  m  -  a  and  lety  =  n  -  b.  Consider  the  set,  lxy  ,  of  all  positive  integers,  which 
in  their  usual  binary  representation  contain  exactly  y  ones  and  no  more  than  x 
zeros.  If,  in  the  usual  representation,  the  integer  contains  y  ones  andx-  c  zeros, 
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let  c  zeros  precede  the  usual  representation.  This  will  be  called  the  “modified 
binary  representation.”  For  instance,  if  x  =4  and  y  =2,  the  integer  6  usually 
represented  by  110  would  be  written  000110  in  the  modified  representation.  If 
k  is  any  integer  in  Ix  y,  and  if  the  modified  representation  of  fc  is  ij  i2  .  •  .  ix+y  , 
define  6k  ,  as  i  • .  Thus,  if  x  =  1  and  y  =  2,  one  would  have  55i  -553  =  1  and  %2_0» 
since  5  =  101. 

Now  there  is,  in  the  fashion  previously  described,  a  one-to-one  corres¬ 
pondence  between  the  integers  in  the  set  I  and  the  paths  connecting  (m,n)  to 
(a,b).  Let  nk  denote  the  path  that  corresponds  to  the  integer  fc  in  \x  .  Then 
5k>j  =0  means  that  the  jth  transition  for  nk  is  to  the  left,  and  5kj  =  l’  means  that 
the  jth  transition  is  downward. 

Notice  that  if  6A1  =  0,  the  second  state  is  (m  -  l,n);  if  &A1  =  1,  the  second 
state  is  (m,  tt  -  1).  In  either  case,  the  second  state  is  (m  -  1  +  6tl,n  -  641).  In 
general,  as  can  easily  be  seen,  the  system  will  be  in  state 

(m-' +  i  vn~£  v 

;*  i  H 

after  r  transitions,  provided  that  r  ^  x  +  y. 

In  this  way  the  notation  that  has  just  been  introduced  enables  one  to  talk 
freely,  but  specifically,  about  the  motion  of  the  system. 

An  example  of  the  use  of  this  notation  is  provided  in  writing  down  the  prob¬ 
ability,  say  pj.(a,b;  m,n),  that  the  system  proceeds  from  the  state  (m,n)  to  the 
state  (a,b)  along  the  path  nk .  It  is  known  from  the  earlier  discussion  that  pA( a,b; 
m.n)  is  a  product  of  certain  a’s  and  fi’ s,  which  are  completely  determined  by  the 
path,  but  in  ordinary  terms  it  is  hard  to  state  just  which  a’s  and  j3’s  are  to  be  used. 

Consider  the  path  rrk.  Define  ykl  as  a(m,n)  if  6kl  =  1,  and  as  j3(  m,n)  if  64l  =  0. 
In  general,  define  r  +  i ,  for  r  =  0,  1*,  .  .  .,  x  +  y  -  1  as 

f  r 

<x(m  -r  +  2]  Skj  ,  n  -  £  **/>  Si,r+ 1  -  1 . 

hi  h  i 

and  as 

r  r 

P(m-r  +  2]  Shj’n~  H  Ski)i{  5*,r+l  =0- 

/•  i  hi 

Specifically, 

y*.r+l  -  Sk,r>l  a(m-r  +  &kj,  n  -  £  Skj)  +  (1  -  5i>r+1)  f3(m-r+  £  Skj,  n  -  £  Skj). 

h  i  /-i  hi  /'■! 


Now, 

x+y 

pk{a,bi  m,n)  -  yi(/. 

A  summary  of  the  results  up  to  this  point  of  the  discussion  might  indicate 
that  a  notation  has  been  introduced  that  facilitates  writing  down  the  probability 
that  the  system,  starting  at  (m,n),  travels  a  certain  path  to  (a,b). 

Of  course  it  is  necessary  to  know  more,  e.g.,  the  probability,  say  Gk(afb,t; 
m,n),  that,  if  the  system  travels  the  path  irk  from  (m,n)  to  (a,b),  it  will  be  at 
(a,b)  at  a  certain  time  t.  If  G^(a,b,t;  m,n)  were  known,  p(a,b,t;  m,n)  would  also 
be  known,  since 

p(a,b,t ;  m,n)  =  £  pk(a,b-,  m,n)  Gk(a,b,t\  m,n). 
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Perhaps  the  simplest  way  to  find  Gk(a,  b,  t;  m,n)  is  to  use  the  variates  Tc  d 
discussed  in  the  preceding  section,  dealing  with  the  statement  of  the  problem. 

If  it  is  known  that  the  system  moves  along  the  path  irk  ,  then  it  will  be  at  (a,b) 
at  time  t  if  and  only  if  Tm  n  +  .  .  .  +  Tab  >  t ,  but  the  sum  of  all  these  variates 
except  the  last  is  <t.  The  variates  appearing  in  the  two  sums  are  to  be  those 
arising  along  the  particular  path  nk  considered. 

It  is  possible  to  be  more  specific  about  the  variates  present  in  these  sums 
if  the  previous  notation  for  the  description  of  the  path  is  used. 

The  state  in  which  the  system  will  be  found  after  r  transitions  as  it  pro¬ 
ceeds  along  7 T/(  was  written  out  earlier.  If  this  state  is  denoted  by  [m(fe  ,r),n(fe,r)], 
the  earlier  result  was  that 

r 

m(/c,r)  *•  m  —r  +  £  8t/. 

/*! 

and 

r 

n(k,r)  *=  n  -  £  8.  .  . 

i- 1 


The  variates  that  must  be  considered  on  nk  are  T^^  „(*  r),  for  r  =  0,  1, 

.  .  .,x  +y.  The  probability  Gk(a, b,t;  m,n),  then,  is  the  probability  of  the  compound 
event 

xty-1  xty 

E  Tm(k,r),  n(k.r)  <  ‘  and  E  Tmik,r),  n(k,r)  >  1  • 
r=0  r»0 


The  probabilities  that  enter  into  the  computation  of  the  probability  of  this 
compound  event  can  be  found  quite  directly  by  the  ordinary  method  of  charac¬ 
teristic  functions. 

Let  fcJ  denote  the  characteristic  function  for  the  variate  TcJ ;  then 


fed  (u) 


f  eius  <(>  (c,J)  e-”f>G,d)  ds  m 

J  -  iu 

0 


1 

1  -  iu  r(c,d) 


Now  the  variates  T  ,  .  .  .,  Tab  are  independent,  since  the  process  is  a 
stationary  Markov  process.  Consequently  the  characteristic  function  for  the 
sum 

x+y-1 

E  ^m(*,r),  n(k,r ) 
rmO 


is  the  product 

xty-1 

^  fm(k,r),  n(k,r)  ’ 
rm  0 

and  the  density  function,  gk(s),for  this  sum  is  determined  by  the  equation 


**y- l  X 

r, 0  1  -  «*  r[tr(k,r),  d,k,r)} 


du. 


The  probability  that  Tab  >  t  -  s  is  Since 


Gk{a,b,t;m,n)  ■  J  gk  (s)  <f>(a,b)  ^ 

0 
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it  is  found  that 


Gk(a,b,f,  m,n) 


e  -(*-«)  <f>(a,b ) 


*+y-l 

11 

r»0 


1 

1  -  iu  r[m(£,r),  n(&,r)] 


1 

du  '  ds. 


This  expression  can  be  simplified  in  two  ways.  First,  for  neater  notation, 
write  mkr  and  n y  instead  of  m(k,r)  and  n(k,r).  Second,  interchange  the  order  of 
integration,  integrating  first  with  respect  to  s,  and  then  with  respect  to  u.  The 
integral  involving  s  can  be  evaluated  easily;  it  is  found  that 


1  r  x+y~1  1 

G,(a,b,t\m,n)  =  - —  [I  - - - - 

J  r» 0  1  —  iu  r(ji,  ,« 


-iut 


hr,nhr)  <f>(a,b)-iu 


du . 


To  complete  the  explicit  determ: nation  of  p(a,b,t;  m,n),  merely  combine 
this  result  with  the  earlier  result  coi  cerning  p^(a,b;  m,n).  The  combination  of 
the  two  expressions  is  rendered  more  compact  by  writing 

x+y  x+y-l 

,?o 


The  final  result  is  that  p(a,b,t ;  m,n)  is  given  by 

1  ^  7  x^-X  y*>r+i  e~iut  b) 

—  )  i  n  -  - du . 

2"  kd  J  r«0  1~iur^mkr>nkr/  <t>(a,b)  -  iu 
x ,  y  “oo 

The  validity  of  the  criticism  made  against  this  result  at  the  beginning  of 
this  section  should  now  be  evident.  If  x  +  y  is  at  all  large,  the  integrand  can  be 
[for  general  r  (mAr,ntr)]  quite  complicated,  and  it  is  difficult  to  apprehend  the 
nature  of  the  solution. 
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THE  PROBABILITY  OF  WINNING 


In  the  introduction  to  this  memorandum  it  was  pointed  out  that  the  problem 
under  investigation  arose  from  a  definite  problem  in  the  theory  of  combat.  The 
theoretical  development  in  the  previous  sections  ignored  completely  any  concrete 
interpretation  of  the  results;  the  work  was  purely  mathematical  in  character. 

This  section  is  devoted  to  the  study  of  a  problem  arising  from  the  combat 
interpretation  of  the  stochastic  process  described  in  the  first  two  sections.  If 
the  stochastic  process  is  considered  as  describing  the  interaction  of  two  groups 
of  combatants,  then  one  question  of  quite  great  interest  arises:  “What  is  the 
probability  that  a  given  side  will  win?” 

Now,  of  course,  the  concept  “winning”  must  be  clarified.  A  simple  defi¬ 
nition  (and  the  one  that  will  be  used  here)  is  that  winning  means  the  complete 
annihilation  of  the  opposing  side.  The  symbol  P(m,n)  will  denote  the  probability 
that  the  first  side  wins,  if  there  are  initially  m  combatants  on  the  first  side,  and 
n  combatants  on  the  second  side. 

Relevant  notation  from  the  previous  sections  will  be  preserved  here.  In 
particular,  a(m,n)  will  denote  the  probability  that,  if  there  are  m  combatants  on 
the  first  side  and  n  combatants  on  the  second  side,  then  the  next  casualty  to  oc¬ 
cur  will  be  on  the  second  side,  and  j3(m,n)  will  be  l-a(m,n).  More  briefly,  a(m,n) 
is  the  probability  of  having  the  system  proceed  from  (m,n)  to (  m,n-  1),  and  /3 (m,n) 
is  the  probability  of  having  the  system  proceed  from  (m,n)  to  (  m  -  l,n). 

If  the  system  is  in  state  ( in, n) ,  there  are  two  mutually  exclusive  ways  in 
which  the  first  side  can  win.  The  system  might  proceed  to  the  state  (m,n-  1), 
with  the  first  side  then  winning.  The  probability  of  this  is  a(m,n)  P(:n,n-  1).  Or, 
the  system  might  proceed  to  the  state  ( m-  l,n)  with  the  first  side  then  winning. 
The  probability  of  this  is  /3 (m,n)  P  (m  -  l,n).  Since  these  are  mutually  exclusive 
events,  the  probability  that  the  first  side  wins  from  the  state  (m,n)  is  the  sum 
of  these  two  probabilities.  The  following  difference  equation  is  therefore 
obtained. 

P{m,n )  =  a (m,n)  P  (m,n  -  1)  +  /3(m,n)  P  (m  -  l,n).  (5) 

Equation  5  is  the  fundamental  equation  for  the  work  that  follows.  It  is  of 
some  interest  to  observe  that  it  is  a  consequence  of  Eq.  3.  It  may  be  obtained 
from  Eq.  3  by  setting  b  =0,  summing  for  a  =1, 2,  .  .  .,  m,  and  letting  t  become 
infinite.  The  partial  derivatives  on  the  left  all  approach  zero  as  t  becomes  in¬ 
finite.  Equation  21,  which  appears  later,  is  similarly  related  to  Eq.  4. 

A  brief  reflection  will  show  that  if  P(  w,0)  is  known  for  m  =  1,  2  ,  .  .  .,  and 
if  P(0,n)  is  known  for  n  =  l,2,  .  .  .,  then  Eq.  5  completely  determines  P(m,n)  at 
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all  lattice  points  in  the  first  quadrant  (i.e.,  at  all  points  having  integral  coordi¬ 
nates).  On  the  other  hand,  from  the  definition  of  P(m,n),  it  is  obvious  that 


P(m,0)  =  1  for  m  =  1,2,... 
P( 0,  n)  =  0  for  n  =  1 ,  2 . 


(6) 


These  equations  will  occasionally  be  referred  to  as  the  “boundary  conditions.’’ 

In  principle  P(m,n)  could  now  be  written  out  explicitly  by  solving  Eq.  5.  If 
the  coefficients  a(m,n)  are  not  too  complicated,  this  is,  in  fact,  possible.  For 
example,  suppose  that  a(m,n)  is  a  constant,  say  a.  Then 


m‘1  (n  +  k -1)1  . 

P(n»)~  E 

k* o  (n-1)1  A! 


(7) 


Or  suppose  that 


a  (m,n) 


am 


am  +  /3b 

where  a  and  j3  are  constants.  Then  it  can  be  shown  that 

k  km*n  HfaVg)  + 11 


PM  =  ft 


E  (-D” 

ku  1 


(m-k)\k\  r[n  +  W/^)  +  1] 


(8) 


The  simplest  way  of  verifying  the  truth  of  these  two  assertions  is  to  observe 
that  the  given  function  satisfies  both  Eqs.  5  and  6,  and  then  appeal  to  the  obvious 
uniqueness  of  the  solution. 

Now  if  m  and  n  are  at  all  large,  both  Eqs. 7  and  8  offer  difficulties  from  the 
point  of  view  of  computation  as  well  as  fro;.i  the  point  of  view  of  simple  under¬ 
standing  of  the  nature  of  the  solution.  Indeed  Eq.  8  is  especially  bad,  since  it 
involves  the  differences  of  numbers  that  may  be  quite  large. 

For  these  two  reasons  the  project  of  obtaining  the  exact  solution  of  Eq.  5 
will  be  abandoned.  Rather  the  problem  to  be  considered  here  is  that  of  finding 
a  simple  approximation  to  P(m,n),  which  will  be  valid  if  m+  n  is  large. 

It  should  be  intuitively  evident  that  without  some  restriction  on  the  behavior 
of  the  coefficients  a(m,n),  general  results  on  the  asymptotic  behavior  of  P(m,n) 
cannot  be  obtained.  Fortunately  the  very  nature  of  the  combat  situations  being 
investigated  here  imposes  just  such  a  restriction,  which  will  now  be  stated. 

Consider  the  two  numbers  a(m,n)  and  a(m+  l,n).  These  represent  the  prob¬ 
abilities  of  a  casualty  on  the  second  side  when  the  first  side  has  strengths  of 
itiorin  +  1.  It  is  certainly  reasonable  to  assume  that  the  stronger  the  first  side, 
the  greater  the  probability  of  a  casualty  on  the  second  side;  i.e.,  to  assume  that 
a(m+  l,n)  >a(ni,n).  A  similar  discussion  shows  the  reasonableness  of  assuming 
that  a(m,n)  >a(m,n-  1). 

What  has  just  been  found  to  be  reasonable  will  now  be  made  into  a  formal 
assumption: 

Throughout  the  rest  of  this  section  it  will  be  assumed  that 

a(m  +  l,n)  >  <x(m,n)  ^ 

a(m,n)  >  a(m,n  -  1). 

It  may  be  noted  that  all  the  particular  forms  of  a(m,n)  that  the  author  has 
seen  in  the  literature  on  Lanchester’s  theory  satisfy  these  assumptions. 
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The  process  for  finding  an  approximation  to  P(m,n)  is  based  on  the  obser¬ 
vation  that  the  value  of  P  at  any  point  on  the  straight  line  whose  equation  is 
m  +  n  =  N  is  a  linear  combination  of  the  va’ues  of  P  at  certain  points  on  the  straight 
line  whose  equation  is  m  =  N  -  1.  As  the  problem  now  stands  a(m,n)  is  defined 
only  if  m  and  n  are  not  both  zero  and  neither  is  negative.  To  facilitate  the  lan¬ 
guage  in  the  following  discussion  it  is  desirable  to  extend  the  domain  of  definition 
of  a.  From  now  on  it  will  be  assumed  that  a(m,n)  is  defined  for  all  m,n  (positive, 
negative,  or  zero)  for  which  iti  +  n  5: 1.  For  negative  values  of  the  arguments  the 
following  conventions  are  to  hold: 

a(m,n)  =a(m/0)  icrn<0; 

a(m,n)  =  a(0,n)  for  w<0; 

0(m,n)  =  1 -a(  m,n)  for  all  m,n. 

It  might  be  noted  that  this  convention  regarding  values  of  a(m,n)  in  the  fourth 
and  second  quadrants  preserves  the  fundamental  assumption  of  Eq.  9.  It  might 
also  be  noted  that  this  extension  of  a(m.n)  to  the  fourth  and  second  quadrants  is 
not  essential  to  the  arguments  that  follow;  rather  it  simply  serves  to  eliminate 
certain  unimportant  details  that  would  otherwise  have  to  be  discussed. 

It  is  now  instructive  to  consider  the  values  of  ooth  P(m,n)  and  a(m,n)  on  the 
straight  line  for  which  m+  n  =  k.  (Although  the  functions  are  defined  only  for  the 
lattice  points,  values  can  be  interpolated  for  both  functions  so  that  the  following 
observations  are  valid.) 

First,  a(m,k  -  m)  is  a  monotone  nondecreasing  function  of  m,  and  0<a(m,k-m) 

<  1. 

Second,  P(m,fc-m)  is  a  distribution  function;  i.e., 

i)  P(m,fe-m)  is  a  nondecreasing  function  of  m 

ii)  lim  P(m,fc-m)  =  0 

m-*— oo 

iii  )lim  P(m,k  -  m)  =  1. 

m-»  oo 

This  suggests  the  possibility  of  trying  to  approximate  P(m,n)  by  a  standard  dis¬ 
tribution  function— for  instance,  the  normal  distribution  function.  For  each 
value  of  k  and  m  there  certainly  exists  a  number  \\'k  (m)  such  that 

1  wkM 

P(m,k—m )  =7=  f  e“^‘  dt ; 

-00 

however,  the  precise  form  of  W4(m)  is  irrelevant.  It  must  be  remembered  that 
only  an  asymptotic  formula  is  being  sought. 

Consider  what  would  happen  if  a(m,n)  were  a  constant,  a.  The  classical 
theorem  of  De  Moivre  and  Laplace  regarding  the  approximation  of  the  Bernoulli 
distribution  by  the  normal  distribution  would  show  that 


am-bn 


for  a  proper  choice  of  a  and  b,  is  asymptotically  equivalent  to  P(m,n). 

It  is  certainly  reasonable  to  ask  whether  a  similar  approximation  might 
not  also  hold  in  the  prcblem  considered  here  for  general  coefficients  a(m,n). 
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At  any  rate  it  suggests  the  plausibility  of  the  following  change  of  variables,  and 
that,  in  fact,  is  all  that  the  preceding  discussion  was  aimed  at.  To  repeat:  The 
following  change  of  variables  is  not  capricious;  it  is  suggested  bv  standard 
classical  theorems. 

Let 

A'  =  m+n  (10) 

u  =  (am  -  br)/l\'rr  +  n)  , 

where  both  a  and  b  are  positive  numbers  to  be  determined  later.  Because  a 
and  b  are  positive,  the  transformation  from  (m,n)  to  (u,N)  is  nonsingular.  The 
inverse  transformation  is 

m  =  (bN  +  u  \/iV)/(a  +  b) 
n  =  (a/V  -  u  \/7V)/(a  +  b) . 


Then 


P(m,n)  =  Pmtn  [(am  -  bn)/(\Jm  +  n)]  =  PN  (u) 
a(m,n)  =  ar/j+n  [(am  -  bn)/{\Jm  +  n)]  =  a^(u) 
fiN(u)  =1-0  N(u). 

P(m,n  -  1)  =  Pm+n_j  [(am  -  bn  +  b)/(yjm  +  n  -  1)1  =  j  [u  /(.V  -  1)  +  b/ y'(/V  -  1)] 


P(m  -  l,n)  =  PCT+(|_j  [(am  -  -  a)/(\/m  +  n  -  1)]  =  PN  { 


The  fundamental  difference  equation  (Eq.  5)  now  becomes 
PN(u)  =  0^(a)  fy.jU  V/V/(/V-l)  +  b/sJW^T)}  +  PN(u)  PN_ J  [u  yW/(/V  -  1)  -  a/VUV^T)] .  (13) 

This  suggests  the  following  procedure:  Consider  the  set  of  all  functions  of 
one  real  variable.  For  N  >2,  let  the  linear  transformation  LN  be  defined  by 

LNf(u)  -0Ln  (u)  f  [a  y/tfAN-  1)  +  b/y/Um.))  +  /3*  (a)  /  [a  >/W  -  1)  -  . 

For  example,  iff(u)  =  u,  and  aN(u)=a{a  constant),  then 


Since 


LNf(u)  =  u  y/N/W  - 1)  +  (a  6  -/?a)/v/Um). 
PN(u)  ^LffPff^u), 

PN(u)  =  Lyy  Ln_  j  •  •  .  L2  P,(u), 


where  Pj(u)  is  either  zero  or  one,  according  to  the  value  of  u. 

Let{jtyy}  be  an  arbitrary  sequence  of  functions,  and  let  hN  -  PN  -  .  (In 

the  sequel,  will  be  the  approximation  to  PN ,  so  that  hN  measures  the  differ¬ 
ence  between  the  exact  solution  of  Eq.  14  and  its  approximation.)  Let  k  be  an 
arbitrary  positive  integer.  From 

hk=Pk~  &k  and  ^*+1  =  Lk+ 1  Pk  ' 
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it  follows  that 

Vl  hk  =  Pk*l  ~  Vl  gk  ; 

and,  since  PM  =  hk+1  +  0i+]  ,  it  follows  that 

l>k*  i  =  Vi  hk  -  Vi  ~Vi  sa¬ 


lience 


hk+ 2  =  V 2  Vl  -  0?*+2  -  Vj  **♦  1 ) . 


Ai+2  =^+2  Vl  W2~V2  KM)-Lm  (gk+1-LM  gk). 

Repetition  of  this  procedure  gives  rise  to  the  following  result: 


where 


hX  =  LN  LN- 1  •  •  •  V  1  “  £  'V* 

M+l 


RN~sN-L n  gN_Y 

and,  for  fe  +  1  <  j  <  n 

W  ■  •  •  •  Vi  kj-LjSj. i>- 

funcUonf!  ^  ^  P0S“lVe  fUnCti0nS'  with  sum  >'  follows  that,  tor  any 

'  I  ~-„<ual,<w  '  9,-1  (»)  I  for  all  j  =  2,  3,  .  ,  .  .  Consequently,  if 


converges,  then 


£  ’“k  I  Si  («)  -  Lj  g.  ,  (u) 

/=2  "  <'u< 00  /  /  1 


2  *, 


ofanhobwe  large6  Nr"  ayaSy  SmaU  SimPly  ^  ***  *  SUfflCiea,1>’  lar^'  ^ardless 

Under  such  circumstances,  it  would  follow  that  thp  qomionoo  Jl  i 
to  zero  [i.e.,  gA,  (u)  is  close  to  P A(u)  for  large  N]  provided  that  L  L  *  C°51Ver^es 
converges  to  zero  as  N  becomes  infinite.  *  L'v-]  ’ 1  ’  L*+l 

The  preceding  discussion  may  be  summarized  under  tv-o  principles. 

First  Principle 

that  S  Seilki"  |  a  r6T  u f 10  aPPr°Xlmale  ‘T  *»  satisfy  the  condition 
A  -oo<u<M  1  fl/ (u)  ~  L/  0/-1 (u)  I  converges. 

Second  Principle 

The  sequence  {P*  -  fl/v}  will  converge  to  zero  if  the  derived  sequence 

{LN  Vl  •  •  •  Vl  (/A  ~  g*)l  (k  fixed) 
converges  to  zero  as  N  becomes  infinite. 
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it  follows  that 


Lk,i  h  =  pk+i  -  gk 


and,  since  Pj.+1  -  hk+l  +  SA+1  }  it  follows  that 

hk+l  =Lk+\  hk  ~(S&+i  ~Lk+1  gk). 


Hence 


or 


k+2  ^i+2  ^k+l  ~  ^i+2~  Lk+2  Sk+0> 


***J  "£'**S  L‘"‘  **"<8**2"£'‘.2  «•!>-  ‘MliW-lw  5i). 
Repetition  of  this  procedure  gives  rise  to  the  following  result: 


where 

and,  for  k  +  1  <  j  <  n 


h"=lNlN-l  + 

/=*+l  ' 

=&N  ~  &N-i 

Ri‘LN-'~'Lhl  ig i-Ljgj.y). 


function?*  a"d  ^  “*  posmve  functlons.  with  sum  one,  it  follows  that,  for  any 


l/WI- 


80  1  *i 1  S -it. '«/<“»  -Lf  8,-1  <“>  I  for  all  ,  =  2,  3 . Consequently,  if 


converges,  then 


£  lub  [  g.  {„)  -  l  g  („) 
j9 2  ^  /  &;-i 


n 

E  *, 

/=*+! 


o^how  ££  n"7  Sma11  Simply  by  *“"«  ‘  sufficiently  large,  regardless 

to ^  -quence  {h„}  converges 
converges  to  aero  as  N  becomes  WinTte  1  P  ‘d<id  ‘hat  L"  L*-l-  4,1 
The  preceding  discussion  may  be  summarized  under  two  principles. 

First  Principle 

that  f  iub  q.  ZT7  {*)  ! no  appr0Xlmate  ^  P/v)  try  to  satisfy  the  condition 
fc  2  -oo  <u<00 1  Hi w  L;  fly-1  t«l  I  converges. 

Second  Principle 

The  sequence  {P*  -  will  converge  to  zero  if  the  derived  sequence 

iLN  LN-\  •••  Lk+\  </*  ~«A)i  (*  fixed) 

converges  to  zero  as  N  becomes  infinite. 
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As  noted  earlier  the  conditions  expressed  in  Eq.  9  imply  that  aJu)  shall 

be  LXdeCmoarpngf  Cti?n  °f  l  ^  the  tW°  principles  ^st  «ted  can 
be  applied  more  information  about  the  asymptotic  behavior  of  aN(u)  is  required 

Throughout  the  rest  of  this  section  it  will  be  assumed  that  required. 


«JV  <“>  =  P  +  (cu/^/W)  +  A  (u,N)/N 
ftN(u)  =  9  -(cu/\/77)  -  A  (u,N)/N 


(14) 


r„hv  rff„L’’,andrfC  T  C0,nstants'  with  P  *  «  =  1,  and  A(w,N)  t,  bended,  for  u  in 
h  ’  "‘l  j;!oaed  lnterval-  It  might  be  remarked  that  for  all  the  types  of  com- 

bat  with  which  the  author  is  familiar,  Eq.  14  is  satisfied. 

in  nr<trPre^ieWf  °f  Uf  P,r°,CeSS  that  wil1  Presently  be  carried  through  is  perhaps 

fie^  Fn  i/irSt’  ,afsinfle  function  8  of  one  real  variable  will  be  found  that  satts- 

.  U.  except  f°r  terms  of  °rder  l/N^  .  Specifically,  it  is  proposed  to 
find  a  function  g  such  that  F  u 

«<«)  -  V“)  S  1“  vOT -  1)  +  [ b/y/W  -  1)11  -  (3n(u)  g  1„  #7(/V  -  1)  -  ta/vT/V“-l)]l  =  o  <l//v3/2) 

Thisjunction,  however,  will  not  quite  satisfy  the  boundary  conditions  demanded 

j  Q,HSfeC?"d’ a  seJuence  of  functions {fl/v(u)}  will  be  constructed  fromfi(u)  that 
do  satisfy  the  boundary  conditions  demanded  by  Eq.  6  and  that  still  satisfy  Eq 
13,  except  for  terms  of  the  order  of  l/N3/2  .  Such  a  sequence  will  satisfy  the 
conditions  enunciated  in  the  first  principle.  y 

Third,  it  will  then  be  shown  that  Eqs.  9  and  14  imply  that  the  conditions 
enunciated  in  the  second  principle  are  satisfied.  conditions 

First  Part 

(sincea+iS^l70Sed  ^  *  £UnCti°n  0  8atisfying  E*  15>  which  can  be  written 

°‘/V(U)  ^  V'N/iN  ~  l)  +  lb/J(N  -  1)11  -  «<“>)  +  PN(u)  (*l“  _  ta/v^vTI)]!  -g{u))=0  (l//>/3/2). 

Apply  Taylor’s  theorem  to  the  two  differences  that  appear  in  this  equation  and 

olT’TTr^u.rifaVtoZi“VOlVln8  mUt  1/N' ign0rlnB  lh0Se  °f 

(l/yfVIlpS-,.)*'.  1/2 N  ltl.2(..Mc]1<'.(pjS,,a!)s.|.0(1/(va/2) 

th„  “  ' be  recaUSd  that  when  the  variable  u  was  Introduced  it  wan  stated 

he  fw  *  Jim"  ‘°  deterained  Iater-  At  this  point  it  is  possible  to  impose 

‘hat  the  'erm 

pb-qa-  0;  (Jg) 

consequently 


■ p * 


b*=qk, 

when  fe  is  yet  to  be  determined.  It  should  be  borne  in  mind  that  Eq.  16  is  in 
act,  an  implicit  equation,  since  the  choice  of  a  and  b  affects  the  values  of  p  and 
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q  in  general.  The  specific  example  considered  below  will  make  this  clear  if  it 
is  not  obvious  already. 

With  a=  pfc  and  b  =  qk ,  the  term  involving  1/N  can  be  simplified;  for  pb2  + 
qa2  =(pq2+  qp2)  k2  =  pqk2.  Then  the  requirement  that  the  term  in  1/N  must  also 
vanish  leads  to  the  following  differential  equation. 

(1  +  2  (a  +  b)  ej  ug'(u)  +  pqk 2  g"  (u)  «  0  ,  (17) 

the  solution  of  which  is 

/  exp|-[l  T  2  (o  +  L)c]i2/(2pqk2)\  dt  (18) 

where  Cj  and  c2  are  arbitrary  constants. 

It  should  now  be  observed  that  the  product  (a  +  b)c  =  fee  is  invariant,  since 
c  involves  h  only  through  a  factor  of  1  At .  This  observation  suggests  the  final 
determination  for  k  .  Simply  choose  k  as  V1  +  2  (a  +  b)c/p«j;  the  coefficient  of 
-t2/2  in  the  exponent  of  Eq.  18  now  becomes  unity.  Thus,  with  these  choices 
for  a  and  b ,  it  is  seen  that 

g(u)  =  C|  /  exp  (-  i2/2)  dt.  (19) 

c2 


Before  proceeding  further  with  the  discussion  of  the  general  case  it  seems 
desirable  to  consider  a  specific  example— one  that  arises  from  the  classical 
Lanchester  theory.  Suppose  that 

a(m,n)  =<Xm/(<Xm  +  fin),  and  f)(m,n)  =  /3 n/(<Xm  +  /Bn),  (20) 

where  a  and  /3  now  denote  two  positive  constants.  With  N  =  m  +  n  and  u  -  (am  -  bn)/ 
V m  +  n ) ,  it  is  found  that 

ayy(u)  =  p  +  c(u/y/N)  +  k(u,N)/N 
and 

PN(u)  =  q-c(u/y/W)  -  k(u,N)/N, 

in  which 

p  =  ab/(ab  +  fid) , 
q  =  fia/(o.b+  /3a), 

and 

c  =  a/3  (a  +  b)/{ab  4  /3a)2  . 


These  expressions  are  found  by  substituting  Eq.  11  in  arm/(am  +  jSn).  and 
expanding  in  powers  of  1/M.  Equation  16  now  becomes 

a&2-  /3a2  =  0. 


With  a  proper  choice  of  signs  it  is  seen  that  a  =k '/a,  b  =  k'/^, where  {;'  is  a 
positive  constant.  This  k'  is  not  the  same  as  the  kin  Eq.  17. 

With  these  values  for  a  and  b  it  is  noted  that  p  =/»/(/»+ //3),  q  =  //3/(V^r + 
ffi),  and  (o  +  j)  c  =  1.  Now  with  a=  k [/a/(/a+ /^)],  and  b=k  [Sp/(fa +  '/&)],  Eq. 
17  becomes 

3 ug’  (u)  +  [v5^7(Vcr  +  V/5)]2 12  g"  (u)  =  o. 
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The  proper  choice  for  k  is,  by  the  general  theory,  /[!  +  2(q  +  b)c  ]/pq  ,  which  here 
becomes  Then  u  =/ 3/v'a/T[(\/a'iti  -  V/3 n  )/Vm  +  ii],  and  Eq.  19  holds,  i.e., 

U 

g(u)  =  Cj  /  exp(-i2/2)  dt. 

C2 

Second  Part 

There  now  is  left,  in  the  general  theory,  the  problem  of  choosing  c1  and  c2. 
There  are  actually  many  possible  choices  for  q  andc2,  although  they  are  all 
asymptotically  equivalent.  Choose,  for  example,  Cj  as  l/VSTF  and  c2  as  -  °°.  It 
then  turns  out  thatg(u)  (now  independent  of  N)  satisfies  the  boundary  condition 
along  the  coordinate  axes  only  asymptotically.  It  is  easy,  however,  by  letting 
c1  and  c2  depend  on  N,  to  obtain  a  sequence  of  functions  {fl/y(u)}  that  actually  sat¬ 
isfies  the  conditions  that,  if  n  =  0,  the  value  of  s^(u)  is  1,  and,  if  m  =  0,  the  value 
of  g/y(u)  is  0. 

To  see  how  this  may  be  done  consider  the  situation  that  occurs  on  the  line 
m  +  n  =  N.  At  the  point  (0,N)  on  this  line  (where  P(0,N)  =  0)  u  has  the  value  -bVN". 
Since  naturally  q  is  not  to  be  0,  g(u)  will  be  0  only  if  c2  =-b>/N\  Along  this  line 
m  +  n  =  N,  u  increases  steadily  until  the  point  (N,0)  is  reached,  at  which 
P(  N,0)  =  1.  At  (N  ,0) ,  u  has  the  value  a  M.  In  order  to  obtain  the  proper  value 
for  g(u),  namely,  1,  Cj  should  be  the  reciprocal  of 

a\[N 

f  e'^dt. 

-bi/N 


Thus,  instead  of  simple  constants  for  cY  and  c2,  values  that  depend  on  N  are 
chosen.  This  leads  to  a  consideration  of  the  sequence  of  functions  {  g^(u)}, 
where 


for 


-bVN  <  u<  a\[N  , 


and 


1  for  u  >a^!7 
0  for  u  <  -b\JN. 


Because 


lim  Nk 
N-*oo 


J  e~  dt  =  0 

x\ffT 


for  any  positive  numbers  k  and  x,  it  follows  that  this  sequence  of  functions  also 
satisfies  the  fundamental  difference  equation  for  PN(u)  to- within  terms  of  order 
1/N ,  just  as  did  g(u). 

Third  Part 

According  to  the  preview  given  earlier  in  this  section  all  that  remains  is 
to  prove  that  LN  Ln_i  .  . .  L*+1  (P*  -  g*)  approaches  zero  as  N  becomes  infinite. 
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Let,  as  before,  l>k  denote  P4  -  gk.  Now  hk  is  a  function  of  one  real  variable, 
defined  at  the  values  of  u  corresponding  to  the  lattice  points  on  the  line  m  +  n  =  k. 
Specifically,  hk  is  defined  at  all  the  numbers  [-bk  +  /  (a  +  b)]/VF,  for  j  =  0,  ±  1,  ±2, 
....  However  (and  fortunately),  it  is  zero  at  most  of  these  points.  In  fact  it  is 
zero  at  every  point  except  possibly  the  points  [-bk  +/ (a  +  b)]/viT  for/ =1,2.  .  .  ., 

R  - 1.  This  follows  from  the  way  in  whick  was  made  to  obey  the  boundary  con¬ 
ditions  of  Eq.  6. 

Let  D4>iV  denote  LN LN-1  .  .  .  Li+1  hk.  Since  Dk  N  =  LNDktN_lf  it  is  ar 
that  Dkjij  is,  for  N  >fe,  a  solution  of  the  fundamental  difference  equation  (Eq.  3). 
Consequently  the  value  of  D4  N  at  any  number  u  is  a  certain  linear  combination 
of  the  values  of  hA,and  since  most  of  these  are  zero  they  can  be  written 


k~l 


/”  1 


Fi,kMu)  hk 


-bk  +  j  (a  +  b) 


The  point  to  be  demonstrated  is  that  Fy&y(u)  approaches  zero  as  N  becomes 
infinite. 

As  a  first  step  it  is  proposed  to  show  that,  if 


then 


Gk  N(u)  =  lub  F..Ju), 

k'N  (all/)  >'k’N 
Gk,N ^  1  Gk+l,N^’ 


To  this  end  observe  that 

Fj,k,N{u)  =  «A  +  ilUa  +  b)j  -b(k  +  1  )]/v/m  I  F  I  [(a  +  b)j  -  b(k  +  Dl/v/mi 


(21) 


so  that 


+  /^+1  \[(a  +  b)(j  +  1)  -  b(k  +  l)]/vTTT|  F  | [(a  +  b)(j  +  1)  -  b(k  +  1  )]/V*  +11, 


^  («*♦!*((»  + «/  -  b(k  +  Dwrm + jsA+1 1  ua + 


b)v  + 1)  -  b(k  +i)]/v/rm)cA+li/v(u). 


But,  by  Eq.  9  the  coefficient  of  Gk+l  N(u)  is  no  greater  than  one.  [If  the 
coefficients  here  seem  remote  from  Eq!  9  transform  the  coefficients  here  into 
o(i,k  -  j  +  1)  and  /3(/  +  l,fe  -j)  by  using  Eq.  11;  then  apply  Eq.  9.]  Thus,  for  every 
i,  F;,k,N(u )  -  Gk+1,»M>  and  from  this»  it  follows  that  GhN(u)<  Gk+l>N(u). 

Next  it  is  proposed  to  show  that  Gr  ^,(u)  can  be  made  arbitrarily  small, 
for  fixed  u,  simply  by  taking  r  and  N  -  r  sufficiently  large. 

It  is  obvious  from  Eq.  14  that  ar  (u)  will  be  sensibly  constant  on  any  bounded 
closed  interval,  if  r  is  sufficiently  large.  Hence,  as  a  little  reflection  will  show, 
Gr  N  (u)  will  be  nea;  the  maximum  term  in  a  binomial  distribution  involving  N  -  r 
trials  with  probability  p  of  success  in  any  one  trial.  But  from  the  classical 
theorem  of  De  Moivre  and  Laplace  this  is  approximately  l/vr[27r(N  -  r)  pq],  which 
is  arbitrarily  small  if  N-  r  is  sufficiently  large.  This  shows  that  GrN(u)  can  be 
made  arbitrarily  small  by  taking  r  and  N  -  r  sufficiently  large. 

From  the  two  stages  of  this  discussion  it  now  follows  that  .  .  . 

L/l+i  (PA  -  qk)  approaches  zero  as  N  becomes  infinite. 

_The  foregoing  proof  shows  that  Dk  N  approaches  zero  at  least  as  fast  as 
im.  It  may  in  fact  approach  zero  even  faster,  as  the  following  example  shows. 

In  the  event  that  o(m,n)  is  not  constant  but  actually  increases  with  increasing 
in  (and  decreases  with  increasing  n),  the  total  contribution  from  any  value  of  P*  - 
gk  to  all  points  on  the  line  m  +  n  =  fe  +  1  is  less  than  one.  Thus,  if  a1  m,n)  =  m/(m  +  n), 
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the  total  contribution  of  any  point  on  the  line  m  +n  =  k  to  all  points  on  the  line  m  +n  = 
k+  1  is  k/(k  +  1).  By  induction  the  total  contribution  of  this  point  on  m  +n  =  k  to  all 
points  on  m+ =  N  is  k/N .  Consequently  in  this  case  D kN  approaches  zero  at  least 
as  fast  as  1/N. 

Summary 

In  this  section  the  probability  that  the  first  side  wins  in  generalized  Lan- 
chester  combat  has  been  studied.  With  m  combatants  initially  on  the  first  side, 
and  n  combatants  initially  on  the  second  side  the  probability  that  the  first  side 
wins  is  given  approximately  by 


/  f  r  ~ 

|v^  e_a,*A  J 

1  [  it 

-b\J m+n  _  / 

where  a  and  b  are  constants  to  be  determined  by  the  relative  effectivenesses  of 
the  two  sides.  A  simpler  form  of  the  approximation,  but  which  does  not  give 
quite  so  accurate  results,  is 


1 

sPIn 


S-X'2  it. 


(23) 


In  the  particular  case  of  the  classical  Lanchester  theory,  where  am/(am  +  /3n) 
gives  the  probability  that  the  next  casualty  will  occur  on  the  second  side,  it  is 
seen  that 


or,  more  closely, 


i  /  3  yjttrn  - \ffin 

1 

1 

r  Vft 

e it 

/ 

it 

U 

’  va 

/ 

v« 

approximates  the  probability  that  the  first  side  wins. 


(24) 


(25) 
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CLASSICAL  THEORY  AND  NUMERICAL  EXAMPLES 


This  section  is  devoted  to  two  somewhat  separate  topics.  The  first  is  the 
computation  of  0(m,n)  and  a(m,n)  for  the  situation  described  in  the  manuscript 
by  Lee  and  Harrison.16  It  will  be  shown  that  the  probabilities  of  casualties  in 
the  type  of  combat  discussed  there  satisfy  Eq.  20,  so  that  the  probability  that 
Force  1  wins  is  given  exactly  by  Eq.  8,  and  approximately  by  Eq.  24  or  25.  The 
second  topic  is  the  comparison,  in  some  numerical  examples,  of  the  exact  and 
approximate  probabilities  in  the  unfavorable  circumstances  that  both  m  and  n 
are  small. 

The  situation  considered  by  Lee  and  Harrison  is  the  following  (with  slight 
changes  of  notation) : 

Two  forces  are  engaged  in  combat.  Force  1  consists  initially  of  m  weapons, 
Force  2  consists  initially  of  n  weapons.  Each  weapon  of  Force  1  fires  at  random 
according  to  the  Poisson  process  with  a  mean  firing  rate  of  X  shots  per  unit  time. 
Each  weapon  of  Force  2  fires  with  a  mean  firing  rate  of  p  shots  per  unit  time. 
The  single-shot  kill  probability  of  a  weapon  on  Force  1  firing  at  a  weapon  on 
Force  2  is  p ;  the  single -shot  kill  probability  of  a  weapon  on  Force  2  firing  at  a 
weapon  on  Force  1  is  q  .  The  engagement  is  continued  until  all  weapons  of  one 
force  or  the  other  are  destroyed. 

For  a  discussion  of  the  Poisson  process  the  reader  is  referred  to  Feller.17 
By  the  well-known  properties  of  the  Poisson  process  it  follows  that,  if  there  are 
m  combatants  in  Force  1  each  firing  at  a  rate  of  X  shots  per  minute,  then  the 
probability  that  Force  1  fires  exactly  j  shots  during  a  time  interval  of  t  minutes 
is 

•  t(mXtV'//!]  . 

Since  1  -  p  is  the  probability  that  any  one  of  these  shots  produces  no  casualty,  the 
probability  that  all  j  shots  miss  is  (1  -  p)> .  Hence,  the  probability  that  Force  1 
fires  exactly  j  shots  and  all  miss  is 

e-mkt  ,  [mXO  —  p)t]i/j  \  . 

Therefore  the  probability  that  Force  1  produces  no  casualties  (regardless  of  the 
number  of  shots  fired)  during  the  time  interval  of  t  minutes  is 

w 

£  e-mAt.[mX(i-P)t]'7/!, 

/*o 


or 

.-m\t  _mX(l-p)t 

c  •  e  f 

which  equals  e~mAp'. 

Thus  it  has  now  been  shown  that,  if  there  are  m  members  in  Force  1,  the 
probability  of  no  casualties  among  Force  2  during  a  time  interval  of  length  t 
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minutes  is  e~m‘Kpt  .  In  the  same  way  if  there  are  n  members  in  Force  2  the 
probability  of  no  casualties  among  Force  1  during  a  time  interval  of  length  l 
minutes  is  e~npl>L. 

Consequently  the  probability  that,  with  m  members  on  the  first  side  and  n 
members  on  the  second  side,  there  are  no  casualties  on  either  bide  during  a 
time  interval  of  length  t  is  exp  [-(mXp  +  npq)  t].  Comparison  of  this  with  Eq.  2 
shows  that  0(in,n)=  mXp+n/uq. 

The  problem  now  is  to  compute  a(m,n),  which,  it  will  be  recalled  from  the 
first  section,  means  the  probability  that  if  a  casualty  does  occur  it  will  occur  on 
the  second  side. 

The  probability  of  no  casualties  among  Force  1  and  at  least  one  casualty 
among  Force  2  during  an  interval  of  t  minutes  is 

_e-mXP‘). 

The  derivative  of  this,  for  t  =  0,  is  mXp.  But,  according  to  the  general  theory  de¬ 
veloped  in  the  first  section,  this  must  also  be 

C((m,n)  •  ). 


Hence, 

a(m,n)  =  mXpAmXn  +  npq). 

Similarly, 

=  npq/(mXp  +  npq )  . 

In  summary  then  it  has  been  shown  that  a(m,n)  and  (3(m,n)  satisfy  Eq.  20, 
with  the  constants  a  and  0  equal  to  Xp  and  pq,  respectively.  Therefore  the  prob¬ 
ability  of  winning  in  this  type  of  combat  can  be  computed  as  the  solution  of  Eq. 

5,  with  the  coefficients  given  by  Eq.  20.  The  explicit  solution  given  by  Eq.  8  can 
be  used,  or,  the  probabilities  can  be  obtained  successively  from  Eq.  5  directly. 
This  latter  method  is  the  one  actually  used  in  the  manuscript  of  Lee  and  Harrison 
previously  cited. 

It  is  clear  that  the  function  given  in  Eq.  8  depends  on  the  ratio  of  a  to  /3, 
and  not  on  their  separate  values.  Let  C  =  a// 3.  For  various  values  of  C,  the 
values  of  P(m,n),  rounded  to  three  decimal  places,  are  tabulated  in  the  manu¬ 
script  by  Lee  and  Harrison.  These  values  have  been  rounded  to  two  decimal 
places  and  provide  the  basis  for  the  “exact”  figures  in  Tables  1  and  2.  For 
Table  3,  the  “exact”  data  are  computed  directly  from  Eq.  5,  with  C  =9. 

Tables  1,  2,  and  3  provide  a  comparison  between  these  exact  values  of 
P(m,n)  and  the  approximate  values  of  P(m,n)  found  by  using  Eq.  24  for  C  =  1, 

C  =  2,  and  C  =  9.  The  comparisons  are  made  only  in  the  most  unfavorable  cir¬ 
cumstances,  when  both  m  and  n  are  small.  It  is  clear  from  the  tables  that,  as 
soon  as  :a  +  n  =  7  or  so,  Eq.  24  should  be  applicable  instead  of  Eq.  8  for  all  ordi¬ 
nary  purposes  of  computation,  provided  that  l/9<  C^9.  This  agreement  between 
the  exact  and  approximate  values  of  P(m,n)  seems  to  be  surprisingly  good. 

In  the  tables  the  lower  entry  in  each  pair  of  lines  is  the  exact  value  of 
P(m,n),  rounded  to  two  decimal  places.  The  upper  entry  in  each  square  is  the 
approximate  value,  computed  by  Eq.  24,  also  rounded  to  two  decimal  places. 
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Table  1 


Table  2 


COMPARISON  OF  EXACT 
AND  APPROXIMATE  VALDES3 
OF  P(m,n)  FOR  C  =  1 


COMPARISON  OF  EXACT 
AND  APPROXIMATE  VALUES3 
OF  P(m,n)  FOR  C  =2 


aThc  upper  figure  in  each  pair  of  lines  is  the 
approximation,  computed  from  Eq.  24.  The  lower 
figure  is  the  exact  value,  computed  from  Eq.  8. 
All  data  are  rounded  to  two  decimal  places. 


3The  upper  figure  in  each  pair  of  lines  is  the 
approximation,  computed  from  Eq.  24.  The  lower 
figure  is  the  exact  value,  comn"*ed  Eq.  8. 
All  data  arc  rounded  to  two  ■'.icimi !  places. 


Table  3 

COMPARISON  OF  EXACT 
AND  APPROXIMATE  VALUES3 
OF  P(m,n)  FOR  C  =  9 


uTlic  upper  figure  in  each  pair  of  lines  is  the 
approximation,  computed  from  Eq.  24.  The  lower 
figure  is  the  exact  value,  computed  from  Eq.  8. 
All  data  are  rounded  to  two  decimal  places. 
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